Corpus-based Extraction of Japanese Compound Verbs
نویسندگان
چکیده
We describe two methods for Japanese compound verb (JCV) extraction, based on synthesis and pattern matching over the Google Japanese n-gram corpus. We devise a number of filters to boost the precision of the corpus-based method, and evaluate the two methods based on a sample of JCVs occurring in varying frequency bands. We also investigate the distribution of JCV token frequency, and the type frequency of their components.
منابع مشابه
Automatic Extraction of Complex Predicates in Bengali
This paper presents the automatic extraction of Complex Predicates (CPs) in Bengali with a special focus on compound verbs (Verb + Verb) and conjunct verbs (Noun /Adjective + Verb). The lexical patterns of compound and conjunct verbs are extracted based on the information of shallow morphology and available seed lists of verbs. Lexical scopes of compound and conjunct verbs in consecutive sequen...
متن کاملConcordance-Based Data-Driven Learning Activities and Learning English Phrasal Verbs in EFL Classrooms
In spite of the highly beneficial applications of corpus linguistics in language pedagogy, it has not found its way into mainstream EFL. The major reasons seem to be the teachers’ lack of training and the unavailability of resources, especially computers in language classes. Phrasal verbs have been shown to be a problematic area of learning English as a foreign language due to their semantic op...
متن کاملVerbs in specialised corpora: from manual corpus-based description to automatic extraction in an English-French parallel corpus
This paper tackles the issue of verbs in specialised corpora in the view of term extraction. Corpus-based manual descriptions to be used in various applications have highlighted the “deviant” uses of verbs in specialised corpora compared with general uses as well as the need for verb extraction. However, very few attention has been given to verbs both in the terminology theory and automatic ter...
متن کاملCorpus based Semi-Automatic Extraction of Persian Compound Verbs and their Relations
Nowadays, Wordnet is used in natural language processing as one of the major linguistic resources. Having such a resource for Persian language helps researchers in computational linguistics and natural language processing fields to develop more accurate systems with higher performances. In this research, we propose a model for semi-automatic construction of Persian wordnet of verbs. Compound ve...
متن کاملA Disambiguation Method For Japanese Compound Verbs
The purpose of this study is to construct a semantic analysis method for disambiguating Japanese compound verbs. Japanese speakers produce a rich variety of compound verbs, making it difficult to process them by computer. We construct a method employing 110 disambiguation rules based on the semantic features of the first verb of a compound and syntactic patterns consisting of co-occurrence betw...
متن کامل